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Version with markings to show changes made 

In the specification: 

Paragraph beginning on page 4, line 13 has been amended as follows: 

Thus, provided herein is a zinc finger protein that binds to a target site, said zinc finger 
protein comprising a first (Fl), a second (F2), and a third (F3) zinc finger, ordered Fl, F2, F3 from 
N-terminus to C-terminus, said target site comprising, in 3' to 5' direction, a first (S 1), a second (S2), 
and a third (S3) target subsite, each target subsite having the nucleotide sequence GNN, wherein if 
SI comprises GAA, Fl comprises the amino acid sequence QRSNLVR (SEP ID NO: 158V if S2 
comprises GAA, F2 comprises the amino acid sequence QSGNLAR (SEP ID NP:801V if S3 
comprises GAA, F3 comprises the amino acid sequence QSGNLAR (SEP ID NP:801) : if SI 
comprises GAG, Fl comprises the amino acid sequence RSDNLAR (SEP ID NO: 130V if S2 
comprises GAG, F2 comprises the amino acid sequence RSDNLAR (SEP ID NO: 130V if S3 
comprises GAG, F3 comprises the amino acid sequence RSDNLTR (SEP ID NP:231V if SI 
comprises GAC, Fl comprises the amino acid sequence DRSNLTR (SEP ID NP:395V if S2 
comprises GAC, F2 comprises the amino acid sequence DRSNLTR (SEP ID NO: 3 95) : if S3 
comprises GAC, F3 comprises the amino acid sequence DRSNLTR (SEP ID NP:395V if SI 
comprises GAT, Fl comprises the amino acid sequence QSSNLAR (SEP ID NP: 1765V if S2 
comprises GAT, F2 comprises the amino acid sequence TSGNLVR (SEP ID NP:1442) : if S3 
comprises GAT, F3 comprises the amino acid sequence TSANLSR (SEP ID NO: 3 77V if SI 
comprises GGA, Fl comprises the amino acid sequence QSGHLAR (SEP ID NP:413V if S2 
comprises GGA, F2 comprises the amino acid sequence QSGHLQR (SEP ID NP:287V if S3 
comprises GGA, F3 comprises the amino acid sequence QSGHLQR (SEP ID NP:287V if SI 
comprises GGG, Fl comprises the amino acid sequence RSDHLAR (SEP ID NP:127) : if S2 
comprises GGG, F2 comprises the amino acid sequence RSDHLSR (SEP ID NP:229V if S3 
comprises GGG, F3 comprises the amino acid sequence RSDHLSR (SEP ID NP:229V if SI 
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comprises GGC, Fl comprises the amino acid sequence DRSHLRT (SEP ID NO: 1506) ; if S2 
comprises GGC, F2 comprises the amino acid sequence DRSHLAR (SEP ID NO: 1092) ; if SI 
comprises GGT, Fl comprises the amino acid sequence QSSHLTR (SEP ID NG:835) ; if S2 
comprises GGT, F2 comprises the amino acid sequence TSGHLSR (SEP ID NP: 1201) ; if S3 
comprises GGT, F3 comprises the amino acid sequence TSGHLVR (SEP ID NP:1425) ; if SI 
comprises GCA, Fl comprises the amino acid sequence QSGSLTR (SEP ID NP:342) ; if S2 
comprises GCA, F2 comprises OSGDLTR (SEP ID NP:220) ; if S3 comprises GCA, F3 comprises 
QSGDLTR (SEP ID NP:220) ; if SI comprises GCG, Fl comprises the amino acid sequence 
RSDDLTR (SEP ID NP:188) ; if S2 comprises GCG, F2 comprises the amino acid sequence 
RSDDLQR (SEP ID NP:1844) ; if S3 comprises GCG, F3 comprises the amino acid sequence 
RSDDLTR (SEP ID NP:188) ; if SI comprises GCC, Fl comprises the amino acid sequence 
ERGTLAR (SEP ID NP:131) ; if S2 comprises GCC, F2 comprises the amino acid sequence 
DRSDLTR (SEP ID NP:417) ; if S3 comprises GCC, F3 comprises the amino acid sequence 
DRSDLTR (SEP ID NP:417) ; if SI comprises GCT, Fl comprises the amino acid sequence 
QSSDLTR (SEP ID NP:1450) ; if S2 comprises GCT, F2 comprises the amino acid sequence 
QSSDLTR (SEP ID NP:1450) ; if S3 comprises GCT, F3 comprises the amino acid sequence 
QSSDLQR (SEP ID NP:132) ; if SI comprises GTA, Fl comprises the amino acid sequence 
QSGALTR (SEP ID NP:1398) ; if S2 comprises GTA, F2 comprises the amino acid sequence 
QSGALAR (SEP ID NP:3339) ; if SI comprises GTG, Fl comprises the amino acid sequence 
RSDALTR (SEP ID NP:153) ; if S2 comprises GTG, F2 comprises the amino acid sequence 
RSDALSR (SEP ID NP:237) ; if S3 comprises GTG, F3 comprises the amino acid sequence 
RSDALTR (SEP ID NP:153) : if SI comprises GTC, Fl comprises the amino acid sequence 
DRSALAR (SEP ID NP:184) ; if S2 comprises GTC, F2 comprises the amino acid sequence 
DRSALAR (SEP ID NP:184) : and if S3 comprises GTC, F3 comprises the amino acid sequence 
DRSALAR (SEP ID NP:184) . 



20 



Atty DktNo. 8325-0011.21 
S11-US3 
09/990,186 

Paragraph beginning on page 5, line 23 has been amended as follows: 

Also provided are methods of designing a zinc finger protein comprising a first (Fl), a 
second (F2), and a third (F3) zinc finger, ordered Fl, F2, F3 from N-terminus to C-terminus that 
binds to a target site comprising, in 3' to 5 1 direction, a first (SI), a second (S2), and a third (S3) 
target subsite, each target subsite having the nucleotide sequence GNN, the method comprising the 
steps of (a) selecting the Fl zinc finger such that it binds to the SI target subsite, wherein if SI 
comprises GAA, Fl comprises the amino acid sequence QRSNLVR (SEP ID NO: 158) : if SI 
comprises GAG, Fl comprises the amino acid sequence RSDNLAR (SEP ID NO: 130) ; if SI 
comprises GAC, Fl comprises the amino acid sequence DRSNLTR (SEP ID NG:395) ; if SI 
comprises GAT, Fl comprises the amino acid sequence QSSNLAR (SEP ID NO: 1765) ; if SI 
comprises GGA, Fl comprises the amino acid sequence QSGHLAR (SEP ID NO:413) ; if SI 
comprises GGG, Fl comprises the amino acid sequence RSDHLAR (SEP ID NP:127) : if SI 
comprises GGC, Fl comprises the amino acid sequence DRSHLRT (SEP ID NP:1506) ; if SI 
comprises GGT, Fl comprises the amino acid sequence QSSHLTR (SEP ID NP:835) : if SI 
comprises GCA, Fl comprises GSGSLTR (SEP ID NP:342) : if SI comprises GCG, Fl comprises 
RSDDLTR (SEP ID NP: 1 88) ; if S2 comprises GCG, F2 comprises RSDDLPR (SEP ID NP: 1 844) ; 
if SI comprises GCC, Fl comprises ERGTLAR (SEP ID NP:131) ; if SI comprises GCT, Fl 
comprises the amino acid sequence QSSDLTR (SEP ID NP:1450) ; if SI comprises GTA, Fl 
comprises the amino acid sequence QSGALTR (SEP ID NP:1398) ; if SI comprises GTG, Fl 
comprises the amino acid sequence RSDALTR (SEP ID NP:153) ; if SI comprises GTC, Fl 
comprises the amino acid sequence DRS ALAR (SEP ID NP:184) ; (b) selecting the F2 zinc finger 
such that it binds to the S2 target subsite, wherein S2 comprises GAA, F2 comprises the amino acid 
sequence QSGNLAR (SEP ID NP:801) : if S2 comprises GAG, F2 comprises the amino acid 
sequence RSDNLAR (SEP ID NP:130) ; if S2 comprises GAC, F2 comprises the amino acid 
sequence DRSNLTR (SEP ID NP:395) : if S2 comprises GAT, F2 comprises the amino acid 
sequence TSGNLVR (SEP ID NP:1442) ; if S2 comprises GGA, F2 comprises the amino acid 
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sequence QSGHLQR (SEQ ID NP:287); if S2 comprises GGG, F2 comprises the amino acid 
sequence RSDHLSR (SEQ ID NP:229); if S2 comprises GGC, F2 comprises the amino acid 
sequence DRSHLAR (SEQ ID NO: 1092); if S2 comprises GGT, F2 comprises the amino acid 
sequence TSGHLSR (SEQ ID NO: 1201); if S2 comprises GCA, F2 comprises the amino acid 
sequence QSGDLTR (SEQ ID NO:220); if S2 comprises GCC, F2 comprises the amino acid 
sequence DRSDLTR (SEQ ID NO :4 17); if S2 comprises GCT, F2 comprises the amino acid 
sequence QSSDLTR (SEQ ID NO: 1450); if S2 comprises GTA, F2 comprises the amino acid 
sequence QSGALAR (SEQ ID NP:3339); if S2 comprises GTG, F2 comprises the amino acid 
sequence RSDALSR (SEQ ID NP:237); if S2 comprises GTC, F2 comprises the amino acid 
sequence DRS ALAR (SEQ ID NO: 1 84); and (c) selecting the F3 zinc finger such that it binds to the 
S3 target subsite, wherein if S3 comprises GAA, F3 comprises the amino acid sequence QSGNLAR 
(SEQIDNO:801) ; if S3 comprises GAG, F3 comprises the amino acid sequence RSDNLTR (SEP 
ID NO:231) ; if S3 comprises GAC, F3 comprises the amino acid sequence DRSNLTR (SEP ID 
NO:395 ) i if S3 comprises GAT, F3 comprises the amino acid sequence TSANLSR (SEP ID 
NO:377) ; if S3 comprises GGA, F3 comprises the amino acid sequence QSGHLQR (SEP ID 
N0287); if S3 comprises GGG, F3 comprises RSDHLSR (SEP ID NP:229V if S3 comprises GGT, 
F3 comprises the amino acid sequence TSGHLVR (SEP ID NO: 1 425) - if S3 comprises GCA, F3 
comprises the amino acid sequence QSGDLTR (SEP ID NO:220V if S3 comprises GCG, F3 
comprises the amino acid sequence RSDDLTR (SEP ID NP: 188V if S3 comprises GCC, F3 
comprises the amino acid sequence DRSDLTR (SEP ID NP:417V if S3 comprises GCT, F3 
comprises the amino acid sequence QSSDLQR (SEP ID NO: 132) - if S3 comprises GTG, F3 
comprises RSDALTR (SEQ ID NP: 153); and if S3 comprises GTC, F3 comprises the amino acid 
sequence DRS ALAR (SEP ID NP: 184) : 

thereby designing a zinc finger protein that binds to a target site. 



Paragraph beginning on page 7, line 5 has been amended as follows: 
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In certain embodiments of the zinc finger proteins and methods described herein, SI 
comprises GAA and F 1 comprises the amino acid sequence QRSNLVR (SEP ID NO: 158V In other 
embodiments, S2 comprises GAA and F2 comprises the amino acid sequence OSGNLAR (SEP ID 
NG:801) . In other embodiments, S3 comprises GAA and F3 comprises the amino acid sequence 
QSGNLAR (SEP ID NP:801) . In other embodiments, SI comprises GAG and Fl comprises the 
amino acid sequence RSDNLAR (SEP ID NP: 130) . In other embodiments, S2 comprises GAG and 
F2 comprises the amino acid sequence RSDNLAR (SEP ID NP: 1 30V In other embodiments, S3 
comprises GAG and F3 comprises the amino acid sequence RSDNLTR (SEP ID NP:23 1) . In other 
embodiments, SI comprises GAC and Fl comprises the amino acid sequence DRSNLTR (SEP ID 
NP:395) . In other embodiments, S2 comprises GAC and F2 comprises the amino acid sequence 
DRSNLTR (SEP ID NP:395) . In other embodiments, S3 comprises GAC and F3 comprises the 
amino acid sequence DRSNLTR (SEP ID NP:395) . In other embodiments, S 1 comprises GAT and 
Fl comprises the amino acid sequence GSSNLAR (SEP ID NP:1765) . In other embodiments, S2 
comprises GAT and F2 comprises the amino acid sequence TSGNLVR (SEP ID NP: 1 442) . In other 
embodiments, S3 comprises GAT and F3 comprises the amino acid sequence TSANLSR (SEP ID 
NP:377) . In other embodiments, SI comprises GGA and Fl comprises the amino acid sequence 
QSGHLAR (SEP ID NP:413) . In other embodiments, S2 comprises GGA and F2 comprises the 
amino acid sequence PSGHLPR (SEP ID NP:287) . In other embodiments, S3 comprises GGA and 
F3 comprises the amino acid sequence PSGHLPR (SEP ID NP:287) . In other embodiments, SI 
comprises GGG and Fl comprises the amino acid sequence RSDHLAR (SEP ID NP : 1 27) . In other 
embodiments, S2 comprises GGG and F2 comprises the amino acid sequence RSDHLSR (SEP ID 
NP:229) . In other embodiments, S3 comprises GGG and F3 comprises the amino acid sequence 
RSDHLSR (SEP ID NP:229) . In other embodiments, SI comprises GGC and Fl comprises the 
amino acid sequence DRSHLTR (SEG IDNP:705) . In other embodiments, S2 comprises GGC and 
F2 comprises the amino acid sequence DRSHLAR (SEP ID NP: 1092) . In other embodiments, SI 
comprises GGT and Fl comprises the amino acid sequence PSSHLTR (SEP ID NP:83 5) . In other 
embodiments, S2 comprises GGT and F2 comprises the amino acid sequence TSGHLSR (SEP ID 
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NP:1201V In other embodiments, S3 comprises GGT and F3 comprises the amino acid sequence 
TSGHLVR (SEP ID NO: 1425) . In other embodiments, SI comprises GCA and Fl comprises the 
amino acid sequence OSGSLTR (SEG ID NP:342V In other embodiments, S2 comprises GCA and 
F2 comprises the amino acid sequence OSGDLTR (SEP ID NP:220V In other embodiments, S3 
comprises GCA and F3 comprises the amino acid sequence OSGDLTR (SEP ID NP:220V In other 
embodiments, SI comprises GCG and Fl comprises the amino acid sequence RSDDLTR (SEP ID 
NP:188V In other embodiments, S2 comprises GCG and F2 comprises the amino acid sequence 
RSDDLQR (SEP ID NP: 1844V In other embodiments, S3 comprises GCG and F3 comprises the 
amino acid sequence RSDDLTR ( SEP ID NP: 1 88V In other embodiments, SI comprises GCC and 
Fl comprises the amino acid sequence ERGTLAR (SEP ID NP:131V In other embodiments, S2 
comprises GCC and F2 comprises the amino acid sequence DRSDLTR (SEP IDNP:417V In other 
embodiments, S3 comprises GCC and F3 comprises the amino acid sequence DRSDLTR (SEP ID 
NP:417V In other embodiments, SI comprises GCT and Fl comprises the amino acid sequence 
OSSDLTR (SEP ID NP: 1450V In other embodiments, S2 comprises GCT and F2 comprises the 
amino acid sequence OSSDLTR ( SEP ID NP: 1 450V In other embodiments, S3 comprises GCT and 
F3 comprises the amino acid sequence PSSDLPR (SEP ID NO: 132V In other embodiments, SI 
comprises GTA and Fl comprises the amino acid sequence OSGALTR (SEO ID NO: 1 398V In other 
embodiments, S2 comprises GTA and F2 comprises the amino acid sequence OSGALAR (SEP ID 
NP:3339V In other embodiments, SI comprises GTG and Fl comprises the amino acid sequence 
RSDALTR (SEP ID NP:153V In other embodiments, S2 comprises GTG and F2 comprises the 
amino acid sequence RSDALSR (SEP ID NP : 23 7V In other embodiments, S3 comprises GTG and 
F3 comprises the amino acid sequence RSDALTR (SEP ID NP:153V In other embodiments, SI 
comprises GTC and Fl comprises the amino acid sequence DRSALAR (SEP ID NP: 1 84) . In other 
embodiments, S2 comprises GTC and F2 comprises the amino acid sequence DRSALAR (SEP ID 
NP:184V In other embodiments, S3 comprises GTC and F3 comprises the amino acid sequence 
DRSALAR (SEP ID NP: 184V 
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Paragraph beginning on page 9, line 10 has been amended as follows: 

Figure 1 shows results of site selection analysis of two representative zinc finger proteins 
(leftmost 4 columns) and measurements of binding affinity for each of these proteins to their 
intended target sequences and to variant target sequences, (rightmost 3 columns). Analysis of ZFP1 
is shown in the upper portion of the figure and analysis of ZFP2 is shown in the lower portion of the 
figure. For the site selection analyses, the amino acid sequences of residues -1 through +6 of the 
recognition helix of each of the three component zinc fingers (F3 (ZFP1. SEP ID NO: 1 30; ZFP7 
S EP ID NP:420) , F2 (ZFP1. SEP ID NO: 1051: ZFP2. SEP TP NP:889^ and F1 (ZFP1. SEP TP 
NP:395; ZFP2, SEP ID NP:685) ) are shown across the top row; the intended target sequence 
(divided into finger-specific target subsites) is shown across the second row, and a summary of the 
sequences bound is shown in the third row. Data for F3 is shown in the second column, data for F2 
is shown in the third column, and data for Fl is shown in the third column. 



Paragraph beginning on page 9, line 24 has been amended as follows: 

Figure 2 shows amino acid sequences of zinc finger recognition regions (amino acids -1 
through +6 of the recognition helix) that bind to each of the 16 GNN triplet subsites. Three amino 
acid sequences are shown for each trinucleotide subsite; these correspond to optimal amino acid 
sequences for recognition of the subsite from each of the three positions (finger 1, Fl fSEP ID 
NPS:6 88,2534, 676, 1769, 342. 1450. 131. 158. 1765. 395. 1407. 2644. 705. 1398. 1733* 184) - 
finger 2, F2 (SEP ID NPS:688. 229. 1446. 3051. 220. 1450. 417. 801 . 1442. 395. 1824. 1 201 . 979 
3339, 1151 & 184); or finger 3, F3 (SEPIDNPS:943.229. 676. 1769.220. 1365.417. 801.3525. 
395, 1824, 1 425, 972, 3592, 952 & 1 84)) in a three-finger zinc finger protein. Amino acid sequences 
are from N-terminal to C-terminal; nucleotide sequences are from 5' to 3'. 



Paragraph beginning on page 10, line 1 has been amended as follows: 
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Also shown are site selection results for each of the 48 position-dependent GNN-recognizing 
zinc fingers. These show the number of times a particular nucleotide was present, at a given 
position, in a collection of oligonucleotide sequences bound by the finger. For example, out of 15 
oligonucleotides bound by a zinc finger protein with the amino acid sequence QSGHLAR (SEP ID 
NO:413) present at the finger 1 (F 1 ) position, 1 5 contained a G in the 5 ' -most position of the subsite, 
15 contained a G in the middle position of the subsite, while, at the 3 '-most position of the subsite, 
10 contained an A, 3 contained a G and 2 contained a T. Accordingly, this particular amino acid 
sequence is optimal for binding a GGA triplet from the Fl position. 

Paragraph beginning on page 10, line 10 has been amended as follows: 

Figures 3 A, 3B and 3C show site selection data indicating positional dependence of GCA-, 
GAT- and GGT-binding zinc fingers. The first and fourth (where applicable) rows of each figure 
show portions of the amino acid sequence of a designed zinc finger protein (Fl column. SEP ID 
NOS:220. 1765. 1442. 835 & 1425: F2 column, SEP ID NOS:220. 1765, 1442. 889. 1425: F3 
column. SEP ID NPS:220. 159. 377. 889 & 1425V Amino acid residues-1 through +6 of each a- 
helix are listed from left to right. The second and fifth (where applicable) rows show the target 
sequence, divided into three triplet subsites, one for each finger of the protein shown in the first and 
fourth (where applicable) rows, respectively. The third and sixth (where applicable) rows show the 
distribution of nucleotides in the oligonucleotides obtained by site selection with the proteins shown 
in the first and fourth (where applicable) rows, respectively. Figure 3A shows data for fingers 
designed to bind GCA; Figure 3B shows data for fingers designed to bind GAT; Figure 3C shows 
data for fingers designed to bind GGT. 

Paragraph beginning on page 10, line 21 has been amended as follows: 

Figures 4A and 4B show properties of the engineered ZFP EP2C. Figure 4A shows site 
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selection data. The first row provides the amino acid sequences (F3. SEP ID NO: 1100: F2. SEP 
ID NO:237;FL SEP ID NP:1450) of residues -1 through +6 of the recognition helices for each of 
the three zinc fingers of the EP2C protein. The second row shows the target sequence (5 s to 3'); 
with the distribution of nucleotides in the oligonucleotides obtained by site selection indicated below 
the target sequence. 

Paragraph beginning on page 13, line 19 has been amended as follows: 

A D-able subsite within a target site has the motif 5 'NNGK3 ' (SEP ID NP :4084V A target 
site containing one or more such motifs is sometimes described as a D-able target site. A zinc finger 
appropriately designed to bind to a D-able subsite is sometimes referred to as a D-able finger. 
Likewise a zinc finger protein containing at least one finger designed or selected to bind to a target 
site including at least one D-able subsite is sometimes referred to as a D-able zinc finger protein. 

Paragraph beginning on page 14, line 16 has been amended as follows: 

Table 6 lists a collection of consensus sequences for zinc fingers and the target sites bound 
by such sequences. Conventional one letter amino acid codes are used to designate amino acids 
occupying consensus positions. The symbol "X" designates a nonconsensus position that can in 
principle be occupied by any amino acid. In most zinc fingers of the C 2 H 2 type, binding specificity 
is principally conferred by residues -1, +2, +3 and +6. Accordingly, consensus sequence 
determining binding specificity typically include at least these residues. Consensus sequences are 
useful for designing zinc fingers to bind to a given target sequence. Residues occupying other 
positions can be selected based on sequences in Tables 1-5, or other known zinc finger sequences. 
Alternatively, these positions can be randomized with a plurality of candidate amino acids and 
screened against one or more target sequences to refine binding specificity or improve binding 
specificity. In general, the same consensus sequence can be used for design of a zinc finger 
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regardless of the relative position of that finger in a multi-finger zinc finger protein. For example, 
the sequence RXDNXXR (SEP ID NO:406(» can be used to design a N-terminal, central or C- 
terminal finger of three finger protein. However, some consensus sequences are most suitable for 
designing a zinc finger to occupy a particular position in a multi-finger protein. For example, the 
consensus sequence RXDHXXQ (SEP ID NQ:4055^ is most suitable for designing a C-terminal 
finger of a three-finger protein. 

Paragraph beginning on page 32, line 23 has been amended as follows: 

The amino acid sequence RSDXLXR (SEP ID NP:4085) (position -1 to +6 of the 
recognition helix) was found to be optimal for binding to the four GNG triplets, with Asn +3 
specifying A as the middle nucleotide; His +3 specifying G as the middle nucleotide; Ala +3 specifying 
T as the middle nucleotide; and Asp +3 specifying cytosine as the middle nucleotide. At the +5 
position, Ala, Thr, Ser, and Gin, were tested, and all showed similar specificity profiles by site 
selection. Interestingly, and in contrast to a previous report (Swirnoff et al. (1995) Mol. Cell. Biol. 
15:2275-2287), site selection results indicated that three naturally-occurring GCG-binding fingers 
from zif268 and Spl, having the amino acid sequences RSDELTR (SEP ID NP: 123V RSDELQR 
(SEQ ID NP:302), and RSDERKR (SEP ID NP:1 100V were not GCG-specific. Rather, each of 
these fingers selected almost equal numbers of GCG and GTG sequences. Analysis of binding 
affinity by gel-shift experiments confirmed that finger 3 of zif268, having the sequence RSDERKR 
(SEP ID NP: 1100V binds GCG and GTG with approximately equal affinity. 

Paragraph beginning on page 33, line 5 has been amended as follows: 

Based on existing design rules, the amino acid sequence QSGDLTR (SEP ID NP:220^ (-1 
through +6) was tested for its ability to bind the GCA triplet from three positions (Fl, F2, and F3) 
within a three-finger ZFP. Figure 3 A shows that the QSGDLTR (SEP ID NP:220 > > sequence bound 
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preferentially to the GCA triplet subsite from the F2 and F3 positions, but not from Fl . In fact, the 
presence of QSGDLTR (SEP ID NO:220) at the Fl position of three different three-finger ZFPs 
resulted predominantly in selection of GCT. Accordingly, an attempt was made to redesign this 
sequence to obtain specificity for GCA from the F 1 position. Since the sequence Q" 1 G +2 S +3 R +6 (SEP 
IDNG:4065) had previously been selected from a randomized F 1 library using GCA as target (Rebar 
et al (1994) Science 263:671-673), a D (asp) to S (ser) change was made at the +3 residue of this 
finger. The resulting sequence, QSGSLTR (SEP ID NO: 3421 . was tested for its binding specificity 
by site selection and found to preferentially bind GCA, from the Fl position, in three different ZFPs 
(see Figure 2). 

Paragraph beginning on page 33, line 17 has been amended as follows: 

The GSGSLTR (SEP ID NO: 3421 zinc finger, optimized for recognition of the GCA subsite 
from the Fl position, was tested for its selectivity when located at the F2 position. Accordingly, two 
ZFPs, one containing QSGSLTR fSEPIDNP:3421 at finger 2 and one containing QSGDLTR {SEQ 
ID NG^O) at finger 2 (both having identical Fl sequences and identical F3 sequences) were tested 
by site selection. The results indicated that, when used at the F2 position, QSGSLTR (SEP ID 
NP:342) bound preferentially to GTA, rather than GCA. Thus, for optimal binding of a GCA triplet 
subsite from the Fl position, the amino acid sequence QSGSLTR (SEP ID NP:3421 is required; 
while, for optimal binding of the same subsite sequence from F2 or F3, QSGDLTR (SEP ID 
NP:220) should be used. Accordingly, different zinc finger amino acid sequences may be needed 
to specify a particular triplet subsite sequence, depending upon the location of the subsite within the 
target sequence and, hence, upon the position of the finger in the protein. 

Paragraph beginning on page 33, line 29 has been amended as follows: 

Positional effects were also observed for zinc fingers recognizing GAT and GGT subsites. 
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The zinc finger amino acid sequence QSSNLAR (SEP ID NO: 1 7651 (-1 through +6) is expected to 
bind to GAT, based on design rules. However, this sequence selected GAT only from the Fl 
position, and not from the F2 and F3 positions, from which the sequence GAA was preferentially 
bound (Figure 3B). Similarly, the amino acid sequence QSSHLTR (SEP ID NP:8351 which, based 
on design rules, should bind GGT, selected GGT at the Fl position, but not at the F2 and F3 
positions, from which it preferentially bound GGA (Figure 3C). Conversely, the amino acid 
sequence TSGHLVR (SEP ID NP: 14251 has previously been disclosed to recognize the triplet 
GGT, based on its selection from a randomized library of zif268 finger 2. U.S. Patent No. 
6,140,081 . However, TSGHLVR (SEP ID NP: 14251 was not specific for the GGT subsite when 
located at the Fl position (Figure 3C). These results indicate that the binding specificity of many 
fingers is position dependent, and particularly point out that the sequence specificity of a zinc finger 
selected from a F2 library may be positionally limited. 

Table on page 66 has been amended as follows: 



TABLE 6 





FINGER (N^C) 


TRIPLET 


Fl 


SEO ID 

NO: 


F2 


SEO ID 
NO: 


F3 


SEO ID 
NO: 


AGG 










RXDHXXQ 


4055 


ATG 










RXDAXXQ 


4056 


CGG 










RXDHXXE 


4057 


GAA 






QXGNXXR 


4058 






GAC 


DXSNXXR 


4059 






DXSNXXR 


4059 


GAG 


RXDNXXR 


4060 


RXSNXXR 
RXDNXXR 


4061 
4060 


RXDNXXR 


4060 


GAT 


QXSNXXR 
TXSNXXR 
TXGNXXR 


4062 
4063 
4064 


TXGNXXR 


4064 






GCA 


QXGSXXR 


4065 


QXGDXXR 


4066 
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GCC 


EXGTXXR 


4067 










GCG 


RXDEXXR 


4068 


RXDEXXR 


4068 


RXDEXXR 

P YHTYYF 


4068 
4Uoy 


GCT 


QXSDXXR 


4070 


TXGEXXR 


4071 






GGA 






QXGHXXR 


4072 


QXAHXXR 


4073 


GGC 


DXSHXXR 


4074 


DXSHXXR 


4074 






GGG 


RXDHXXR 


4075 


RXDHXXR 


4075 


RXDHXXR 
RXDHXXK 


4075 
4076 


GGT 










TXGHXXR 


4077 


GTA 






QXGSXXR 
OXATXXR 


4065 
4078 






GTG 


RXDAXXR 
RXDSXXR 


4079 
4080 


RXDAXXR 


4079 


RXDAXXR 


4079 


TAG 






RXDNXXT 


4081 






TCG 


RXDDXXK 


4082 










TGT 






TXDHXXS 


4083 







In the claim: 

1 . (Amended) A method for designing a zinc finger protein comprising a first (Fl), a second 
(F2), and a third (F3) zinc finger, ordered Fl, F2, F3 from N-terminus to C-terminus that binds to 
a target site comprising, in 3' to 5' direction, a first (SI), a second (S2), and a third (S3) target 
subsite, each target subsite having the nucleotide sequence GNN, the method comprising the steps 
of 

selecting the F 1 zinc finger such that it binds to the S 1 target subsite, wherein if S 1 comprises 
GAA, Fl comprises the amino acid sequence QRSNLVR ( SEP ID NO: 1 58V if SI comprises GAG, 
Fl comprises the amino acid sequence RSDNLAR (SEP ID NO^OV if SI comprises GAC, Fl 
comprises the amino acid sequence DRSNLTR (SEP ID NP:395V if SI comprises GAT, Fl 
comprises the amino acid sequence QSSNLAR (SEP ID NP: 1765V if SI comprises GGA, Fl 
comprises the amino acid sequence QSGHLAR (SEP ID NP:413V if SI comprises GGG, Fl 
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comprises the amino acid sequence RSDHLAR (SEP ID NO: 1271 : if SI comprises GGC, Fl 
comprises the amino acid sequence [DRSHLRT] DRSHLTR TSEO ID NO: 1 506V if SI comprises 
GGT, Fl comprises the amino acid sequence QSSHLTR (SEP ID NP:835V if SI comprises GCA, 
Fl comprises OSGSLTR (SEP ID NO:342V if SI comprises GCG, Fl comprises RSDDLTR (SEP 
IDNP:188) ; [if S2 comprises GCG, F2 comprises RSDDLQR;] if SI comprises GCC, Fl comprises 
ERGTLAR (SEP ID NO: 131); if SI comprises GCT, Fl comprises the amino acid sequence 
QSSDLTR (SEQ ID NO: 1450); if SI comprises GTA, Fl comprises the amino acid sequence 
QSGALTR (SEQ ID NO: 1398); if SI comprises GTG, Fl comprises the amino acid sequence 
RSDALTR (SEQ ID NO: 153); if SI comprises GTC, Fl comprises the amino acid sequence 
DRS ALAR fSEPIDNP:184V 

selecting the F2 zinc finger such that it binds to the S2 target subsite, wherein S2 comprises 
GAA, F2 comprises the amino acid sequence QSGNLAR (SEP ID NP:801 V if S2 comprises GAG, 
F2 comprises the amino acid sequence RSDNLAR (SEP ID NP:130V if S2 comprises GAC, F2 
comprises the amino acid sequence DRSNLTR (SEP ID NP:395V if S2 comprises GAT, F2 
comprises the amino acid sequence TSGNLVR (SEP ID NO: 1442V if S2 comprises GGA, F2 
comprises the amino acid sequence QSGHLQR (SEP ID NP:287V if S2 comprises GGG, F2 
comprises the amino acid sequence RSDHLSR fSEG ID NP:229V if S2 comprises GGC, F2 
comprises the amino acid sequence DRSHLAR (SEP ID NO: 1092V if S2 comprises GGT, F2 
comprises the amino acid sequence TSGHLSR (SEP ID NO: 1201V if S2 comprises GCA, F2 
comprises the amino acid sequence QSGDLTR (SEP ID NP:220V if S2 comprises GCG. F2 
comprises the amino acid sequence RS DDLPR rSRQ ID NP: 1844V if S2 comprises GCC, F2 
comprises the amino acid sequence DRSDLTR (SEP ID NP:417V if S2 comprises GCT, F2 
comprises the amino acid sequence QSSDLTR (SEP ID NP: 1450V if S2 comprises GTA, F2 
comprises the amino acid sequence QSGALAR (SEP ID NP:3339V if S2 comprises GTG, F2 
comprises the amino acid sequence RSDALSR (SEP ID NP:237V if S2 comprises GTC, F2 
comprises the amino acid sequence DRS ALAR (SEP ID NP: 1 84V and 
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selecting the F3 zinc finger such that it binds to the S3 target subsite, wherein if S3 comprises 
GAA, F3 comprises the amino acid sequence QSGNLAR (SEP ID NO:80n : if S3 comprises GAG, 
F3 comprises the amino acid sequence RSDNLTR (SEP ID NP:2311 : if S3 comprises GAC, F3 
comprises the amino acid sequence DRSNLTR (SEP ID NP:3951 : if S3 comprises GAT, F3 
comprises the amino acid sequence TSANLSR (SEP ID NP :377V if S3 comprises GGA, F3 
comprises the amino acid sequence QSGHLQR (SEP ID NP:2871 : if S3 comprises GGG, F3 
comprises RSDHLSR (SEP ID NP:229V if S3 comprises GGT, F3 comprises the amino acid 
sequence TSGHLVR (SEP ID NP: 14251 : if S3 comprises GCA, F3 comprises the amino acid 
sequence QSGDLTR (SEP ID NP:2201 : if S3 comprises GCG, F3 comprises the amino acid 
sequence RSDDLTR (SEP ID NO: 1881 : if S3 comprises GCC, F3 comprises the amino acid 
sequence DRSDLTR (SEP ID NP:4171 : if S3 comprises GCT, F3 comprises the amino acid 
sequence QSSDLQR (SEP ID NP: 1321 : if S3 comprises GTG, F3 comprises RSDALTR (SEP ID 
NO:153) : and if S3 comprises GTC, F3 comprises the amino acid sequence DRSALAR (SEP ID 
NP:1841 : 

thereby designing a zinc finger protein that binds to a target site. 
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